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WHAT IS CLAIMEP IS : 

(lyO^^r 1. Iaji automated method of designating text, taken from a set of citing 
^^documents, as^ reasons for citing (RPC) that are associated with respective citing 
5 instances of a cited document, the method comprising: 

obtaining contexts of the citing instances in the respective citing documents, each 
context including text that includes the citing instance and text that is near the citing 
instance; 1 

analyzing the content of the contexts; and 
10 selecting, from the citing insrances 1 context, text that constitutes the RFC, based 

on the analyzed content of the contexts. 



2. An automated method of designating text, taken from a set of citing 
documents, as reasons for citing (RFC) associated with respective citing instances of a 
15 cited document, the method comprising; 



inputting text 
dividing the 



irom the citing documents; 

citing documents' text to define paragraphs, and dividing the 



paragraphs to define < entences; 



obtaining com 



E of the citing instances in the respective citing documents, each 
ince that includes the citing instance and at least one sentence 
nee; 
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generating a content word list based on words that are in the citing documents 7 
/contexts; 

calculating for the sentences in the citing documents' contexts, respective content 

scores that are based on frequency counts of content words that are recited in the 

\ ° 
respective sentences; and 

selecting, from the citing documents 1 contexts, the sentences that constitute the 
RFC, based on the calculated content scores. 



10 



3. The metttod of claim 2, wherein the content word generating step includes: 
generating the content word list based on words that are included in the contexts 
of at least two of the citing\documents. 



15 



4. The method o^claim 2, wherein the content word generating step includes; 
generating the content word list based on words that are included both in the cited 
document itself and in the context of at least one citing document. 



20 




5. An automated method for selecting content words from documents, and 
for determining contend stote^/for respective content words that indicate the content 
words* degree of relev^nc^m^nethod comprising: 
associating paragraph^ fro\n the documents; 
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processing text ih the associated paragraphs to eliminate text that conveys little 
about the content of the paragraphs; 

determining common words that are not eliminated by the processing step and 
that are found in plural paragraphs, while tallying content scores that indicate respective 
numbers of paragraphs in which the respective common words are encountered; and 

forming the contentjword list as including the common words linked to respective 
content scores. 



10 



6. The nfethpd oficlairh 5, wherein the paragraph associating step includes: 
pairing paragraphs jSj6n} among documents that cite a cited document. 



15 



20 



7. The method of claim 5, wherein the paragraph associating step includes: 
pairing paragraphs from a\cited document with paragraphs from documents that 

cite the cited document. 

8. The method of claim 5 Wherein the processing step includes: 
removing from the paired paragraphs, noise words that convey little information 

about the content of the paragraphs ; and ' 

stemming the words of the pairfed paragraphs to a length that preserves their 
essential character while eliminating characters that convey little information about the 
word's identity, 
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9. An automated method of finding different morphological forms of a word. 



7. 



comprising: 

inputting a word; and 

stemming the word by \ eliminating any tetters after the letter from the 
beginning of the word, wherein N\i$ a positive integer. 



10 



15 



1 0. The method oiclai m\9, wherein N=>6. 



lethoc 



of scoring sentences in citing documents, to 
of\'he respective sentences to reasons that a cited document 



20 



11. An automate^ 
indicate relevance of content i 
is cited, the method comprising: 

calculating respective initial content scores (iCSs) for the sentences in the citing 
documents, based on the content of the seraences; 

calculating respective distances (Ds) of the sentences in the citing documents 
from respective citing instances of the cited Document; and 

calculating respective content scoites (CSs) for the sentences in the citing 
documents, based on at least the ICSs and thd distances. 
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15 



12. The method ofi claim 1 1, further comprising normalizing the ICSs to form 
normalized initial content scoles (NlCSs) for use by the CS calculation step, by taking 
into account: 

a) numbers of wordk in the respective sentences; and 

b) a largest frequency count in a content word list that includes; 

1) a set of content words that are found in the sentences of the citing 
documents, and 

2) a set of frequency counts linked to corresponding content words in 
the set of content words. 

13. The method of\fl&m y{, further comprising: 

modifying the distanJs^/ 1\ \o form respective modified absolute distances 
(MADs) for use by the CS calciMtioA step, based on criteria relating to predetermined 
statistical observations of the implications of placement of a sentence in the citing 
document relative to the citing instance. 



14. The method of claim 1 3 , wherein the criteria include: 
whether the sentence is in the same Paragraph as the citing instance. 



20 



15. The method of claim 13, wherein the criteria include: 
whether the sentence is located after the citing instance. 
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10 



AnWpara* us for designating text, taken from a set of citing documents, as 
for citing IpRFC) that are associated with respective citing instances of a cited 
document, the apparatus comprising: 

means for Obtaining contexts of the citing instances in the respective citing 
documents, each coiltext including text that includes the citing instance and text that is 
near the citing instance; 

means for analyzing the content of the contexts; and 

means for selecting, from the citing instances' context, text that constitutes the 
RFC, based on the analyzed content of the contexts. 



15 



20 



17. An apparatus for designating text, taken from a set of citing documents, as 
reasons for citing (RFC) associated with respective citing instances of a cited document, 
the apparatus comprising: 

means for dividingl the citing documents' text to define paragraphs, and for 
dividing the paragraphs to define sentences; 

means for obtaining contexts of the citing instances in the respective citing 
documents, each context including: a sentence that includes the citing instance and at 
least one sentence that is neai the citing instance; 

means for generating \a content word list based on words that are in the citing 
documents* contexts; 
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^\/ means for calculating, for the sentences in the citing documents' contexts, 
respective content scores that are based on frequency counts of content words that are 
recited in the respective sentences; and 

means for selecting, from the citing documents' contexts, the sentences that 
constitute the RFC, based on the Calculated content scores. 



10 



18. The apparatus of clajm 17, wherein the content word generating means 
includes: 

means for generating the contek word list based on words that are included in the 
contexts of at least two of the citing documents. 



15 



19. The apparatus of claim 1 1\ wherein the content word generating means 
includes: 

means for generating the content wori list based on words that are included both 
in the cited document itself and in the context or at least one citing document. 



20 



20. An apparatus for selecting content words from documents, and for 
determining content scores irrespective content words that indicate the content words 1 



degree of relevance, the apparatus comprising: 

>pkrae 



means for associating paragraphs from the documents; 
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15 



means for processing Itext in the associated paragraphs to eliminate text that 
conveys little about the contentW the paragraphs; 

means for determining common words that are not eliminated by the processing 
means and that are found in plural paragraphs, while tallying content scores that indicate 
respective numbers of paragraphs in which the respective common words are 
encountered; and 

means for forming the codpnt word list as including the common words linked to 
respective content scores. 



21. The apparatus/^ cl^n 20, wherein the paragraph associating means 
includes: 

means for pairing paragraphs from among documents that cite a cited document. 

22. The apparatus of claim 20, wherein the paragraph associating means 
includes: 

means for pairing paragraphs from a cited document with paragraphs from 
documents that cite the cited document. 




20 



23. The apparatus of claim 20, wherein the processing means includes; 
means for removing from the paired paragraphs, noise words that convey little 
information about the content of the paragraphs; and 
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means for stemming the words of the paired paragraphs to a length that preserves 
their essential character while^liminating characters that convey little information about 
the word's identity. 



5 24. An apparatus of \finding different morphological forms of a word, the 

apparatus comprising: 

means for stemming the Word, the stemming means including: 
means for eliminating an>\ letters after the N Ul letter from ihe beginning of the 

word; 

10 wherein N is a positive ir 




25. The apparatus of claiim^ wherein N=6. 

26. An apparatus of scoring sentences in citing documents, to indicate 
15 relevance of content of the respective sentences to reasons that a cited document is cited, 

the apparatus comprising: 

means for calculating respective initial content scores (ICSs) for the sentences in 
the citing documents, based on the contenuof the sentences; 

means for calculating respective ai stances (Ds) of the sentences in the citing 
20 documents from respective citing instances of the cited document; and 

means for calculating respective content scores (CSs) for the sentences in the 



citing documents, based on at leas: the ICSs and the distances. 
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27. The apparatus of claim 26, further comprising means for normalizing the 
ICSs to form normalized initial content scores (NICSs) for use by the CS calculation 
means, by taking into account;' 

a) numbers of words in the respective sentences; and 

b) a largest frequency count in a content word list that includes: 

1) a set of content words that are found in the sentences of the citing 
documents, and 

2) a set of frequency counts linked to corresponding content words in 
the set of content words. 



28 . The apparatuslof cl 
means for modifying\tWe <Sj 
distances (MADs) for use by the CS 1 



26, further comprising: 

[tances D to form respective modified absolute 
calculation means, based on criteria relating to 



15 



predetermined statistical observations df the implications of placement of a sentence in 
the citing document relative to the citing instance. 



29. The apparatus of claim 28, wherein the criteria include: 
whether the sentence is in the same paragraph as the citing instance. 



20 



30. The apparatus of claim 28, wherein the criteria include: 
whether the sentence is located after the citing instance. 
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31. A computei-readable memory that, when used in conjunction with a 
computer, can cany out a method of designating text, taken from a set of citing 
documents, as reasons forfeiting (RFC) that are associated with respective citing 
instances of a cited documentAhe computer-readable memory comprising: 

computer-readable code for obtaining contexts of the citing instances in the 
respective citing documents, each context including text that includes the, citing instance 
and text that is near the citing instance; 

computer-readable code for analyzing the content of the contexts; and 
computer-readable code for selecting, from the citing instances' context, text that 
constitutes the RFC, based on the analyzed content of the contexts. 



15 



20 



32. A computer-readable meniory that, when used in conjunction with a 
computer, can cany out a method of designating text, taken from a set of citing 
documents, as reasons for citing (RFC) associated with respective citing instances of a 
cited document, the computer-readable memota comprising: 

computer-readable code for inputting text from the citing documents; 

computer-readable code for dividing! the citing documents' text to define 
paragraphs, and dividing the paragraphs to define sentences; 
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computer-readable code for obtaining contexts of the citing instances in the 
respective china documents, each context including: a sentence that includes the citing 
instance and at l^st one sentence that is near the citing instance; 

computer-readable code for generating a content word list based on words that are 
in the citing documents' contexts; 

computer-readable code for calculating, for the sentences in the citing documents' 
contexts, respectivfe content scores that are based on frequency counts' of content words 
that are recited in the respective sentences; and 

computer-rekdable code for selecting, from the citing documents' contexts, the 
sentences that constitute the RFC, based on the calculated content scores. 

33. The ^mputer-readable memory of claim 32, wherein the content word 
generating computer-readable code includes; 

computer-readable code for generating the content word list based on words that 
15 are included in the contexts of at least two of the citing documents. 

34. The compeer-readable memory of claim 32, wherein the content word 
generating computer-readable code includes: 

computer-readable cide for generating the content word list based on words that 
20 are included both in the citeij document itself and in the context of at least one citing 

document. 
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35. A computeiireadable memory that, when used in conjunction with a 
computer, can carry out an automated method for selecting content words from 
documents, and for determining content scores for respective content words that indicate 
the content words' degree of relevance, the computer-readable mernoiy comprising: 
computer-readable codA for associating paragraphs from the documents; 
computer-readable code\for processing text in the associated .paragraphs to 
eliminate text that conveysJittle aoout ihe content of the paragraphs; 

computer-readable coHe^A determining common words that are not eliminated 
by the processing computei^aa^ble\code and that are found in plural paragraphs, while 
tallying content scores that indicateyespective numbers of paragraphs in which the 
respective common words are encountered; and 

computer-readable code for fotrning the content word list as including the 
common words linked to respective content scores. 

36. The computer-readable memory of claim 35, wherein the paragraph 
associating computer-readable code includes: ' 

computer-readable code for pairing pa^graphs from among documents that cite a 
cited document. 

37. The computer-readable memorjf of claim 35, wherein the paragraph 
associating computer-readable code includes: 
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computer-readable code for pairing paragraphs from a cited document with 
paragraphs from documents that cite the cited document. 



38, The computer-readable memory of claim 35, wherein the processing 
5 computer-readable code includes:! 

computer-readable code for removing from the paired paragraphs, noise words 
that convey little information aboulthe content of the paragraphs; and 

computer-readable code foAstemming the words of the paired paragraphs to a 
length that preserves their essential \character while eliminating characters that convey 
10 little information about the word\ identity . 



20 




39. A computer-readable memory that, when used in conjunction with a 
15 computer, can cany out a method of finding different morphological forms of a word, 

the computer-readable memory comprising:^ 

computer-readable code for stemmink the word by eliminating any letters after 
the letter from the beginning of the word, therein N is a positive integer. 



40. The computer-readable memory of claim 39, wherein N-6. 
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41. A computer-readable memory that, when used in conjunction with a 
computer, can cany out an automated method of scoring sentences in citing documents, 
to indicate relevance of content bf the respective sentences to reasons that a cited 
document is cited, the computer-readable memory comprising: 

computer-readable code for calculating respective initial content scores (ICSs) for 
the sentences in the citing documents! based on the content of the sentences; 

computer-readable code for calculating respective distances (Ds) of the sentences 



in the citing documents from respective citing instances of the cited document; and 

computer-readable code fei calculating respective content scores (CSs) for the 
10 sentences in the citing documents AbasM/on at least the ICSs and the distances. 



42. The computer-readabiei memory of claim 41, further comprising 
computer-readable code for normalizirL ihe ICSs to form normalized initial content 
scores (NICSs) for use by the CS calculation computer-readable code, by taking into 

15 account: \^ 

a) numbers of words in the respective sentences; and 

b) a largest frequency count in a content word list that includes: 

1) a set of content words tfjat are found in the sentences of the citing 
documents, and 

20 2) a set of frequency counts linked to corresponding content words in 

the set of content words. 
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43 . The computer-readable memory of claim 41, further comprising: 
computer-readable colle for modifying the distances D to form respective 
modified absolute distances (LaDs) for use by the CS calculation computer-readable 
code, based on criteria relating to predetermined statistical observations of the 
implications of placement of a sentence in the citing document relative to the citing 
instance. 




44. The computer-rea&bleWmory of claim 43 1 wherein the criteria include: 
whether the sentence is in the same paragraph as the citing instance. 

45. The computer-readable memory of claim 43, wherein the criteria include: 
whether the sentence is located after the citing instance. 
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